Improving the robustness of phonetic segmentation to accent and style variation with a two-staged approach
نویسندگان
چکیده
Correct and temporally accurate phonetic segmentation of speech utterances is important in applications ranging from transcription alignment to pronunciation error detection. Automatic speech recognizers used in these tasks provide insufficient temporal alignment accuracy apart from a recognition performance that is sensitive to accent and style variations from the training data. A two-staged approach combining HMM broad-class recognition with acousticphonetic knowledge based refinement is evaluated for phonetic segmentation accuracy in the context of accent and style mismatches with training data.
منابع مشابه
Lexical H*+L pitch accent in Ryukyuan: Diversities in phonological patterning and phonetic manifestation
Lexical pitch accent languages such as Swedish and Japanese have been claimed to exhibit variation in phonological inventory and/or phonetic manifestation of pitch accents. This paper reports variations in the phonetic manifestation as well as phonological patterning of the lexical H*+L pitch accent in two Ryukyuan dialects – Shuri and Nakijin. The F0 manifestation of H*+L pitch accent in the t...
متن کاملThe Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials
The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...
متن کاملThe Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials
The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...
متن کاملUsing accent information in ASR models for Swedish
A common technique to cope with the large variability in the acoustic realisations of the phonetic classes in speech, is to partition the data according to a linguistically significant variable. In this work, accent dependent phonetic models were trained and used both as an analysis tool for pronunciation variation and in the attempt to improve ASR performance. The Idea Accent dependent trainin...
متن کاملREGION MERGING STRATEGY FOR BRAIN MRI SEGMENTATION USING DEMPSTER-SHAFER THEORY
Detection of brain tissues using magnetic resonance imaging (MRI) is an active and challenging research area in computational neuroscience. Brain MRI artifacts lead to an uncertainty in pixel values. Therefore, brain MRI segmentation is a complicated concern which is tackled by a novel data fusion approach. The proposed algorithm has two main steps. In the first step the brain MRI is divided to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009